MICROCOPY  RESOLUTION  TEST  CHART 

NATIONAL  BURtAU  Of  STANDARDS  )  963  A 


AD- A 189  144 


SECURITY  CLASSIFICATION  OF  THIS  PACE  flWiwl  Data  BnlararfJ 


REPORT  DOCUMENTATION  PAGE 


t.  REPORT  NUMBER  z.  GOVT  ACCESSION  NO. 

293 


4.  TITLE  fanrf  Subtill.) 

The  Parameterization  of  Orthogonal  Matrices: 
A  Review  Mainly  for  Statisticians. 


© 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  3ft)RM 


1  RECIPIENT'S  CATALOG  NUMBER 


«.  PERFORMING  ORG.  REPORT  NUMBER 


7.  AUTMORfa; 

I.  J.  Good  and  A.  I.  Khuri 


».  PERFORMING  ORGANIZATION  NAME  ANO  ADDRESS 

Department  of  Statistics 

Nuclear  Sciences  Center,  University  of  Florida 
Gainesville,  FL  32611 


II  CONTROLLING  OFFICE  NAME  ANO  ADORESS 

Office  of  Naval  Research 

Mathematical  Sciences  Division  (Code  411) 

Arlington,  VA  22217-5000 


4  MONITORING  AGENCY  NAME  4  AOORESS (It  dlltoionl  hoot  Controlling  Olllco) 


•  CONTRACT  OR  GRANT  NUMBERfa) 

N00014-86-K-0059 

R&T  4114552 - 01 

Acct .  No.  49101623459 


10.  PROGRAM  ELEMENT.  PROJECT.  TASK 
AREA  •  WORK  UNIT  NUMBERS 


12  REPORT  DATE 

November  1987 


13  NUMBER  OF  PAGES 

17 


IS.  security  Class.  (ot  thlt  raportj 


Unclassified 


<Sa  DECLASSIFICATION.  DOWNGRADING 
SCHEDULE 


17.  DISTRIBUTION  STATEMENT  (ol  (Jia  mbmlrocl  mot. rod  In  Block  10.  II  dltlorm nl  Iron  K.port) 


■■  &  *  r— • 

'.v_eg  r£ 

•  ’  DEC  0  ? 


ie.  supplementary  notes 

19.  KEY  IfO^OS  (Contlnuo  on  rovoram  •/</•  It  nacaaamry  mid  Idontliy  by  block  numbar) 

■")  Cayley^s  transformation;  Diagonalization,  simultaneous; 
Haar  measure;  Orthogonal  group;  Rotation  groups.^- 

Eulerian  angles; 

20.  ABSTRACT  ( Contlnu #  on  ravait*  oldo  If  noc ooomry  mnd  Identity  by  block  numb  \r) 

Nj. 

This  paper  reviews  methods  for  the  parameterization  of  orthogonal  matrices. 
Examples  of  where  such  parameterization  can  be  used  are  presented. 


00  1473  COITION  OF  1  NOV  EE  IS  OBSOLETE 

S  N  010?-  IF-  014-  6601 


_ Unclassified 

SECURITY  CLASSIFICATION  OF  THIS  PAGE  (Whon  D ala  tnloroO) 


THE  PARAMETERIZATION  OF  ORTHOGONAL  MATRICES: 
A  REVIEW  MAINLY  FOR  STATISTICIANS 


By 


I.  J.  Good 

Department  of  Statistics 
Virginia  Polytechnic  Institute 
and  State  University 
Blacksburg,  VA  24060 


Technical  Report  Number  293 

Department  of  Statistics 
University  of  Florida 
Gainesville,  FL  32611 

November  1987 


Andre  I.  Khuri 
Department  of  Statistics 
University  of  Florida 
Gainesville,  FL  32611 


•  1987.  November- 2  » 


THE  PARAMETERIZATION  OP  ORTHOGONAL  MATRICES: 
A  REVIEW  MAINLY  POR  STATISTICIAN  S x 


By  I.  J.  Good  and  Andre  I.  Khuri 

Virginia  Polytechnic  Institute  a  State  University  and  University  of  Florida 


Abstract 


Techniques  and  applications  for  the  parameterization  of  orthogonal  matrices 
surveyed,  mainly  for  the  benefit  at  statisticians. 


1.  introduction. 

By  an  orthogonal  matrix  C  we  shall  mean  a  real  square  matrix  of  order  n  x  n 


for  which  CC'  is  the  identity 


,  where  C*  denotes  the  transpose  of  C. 


Orthogonal  matrices  are  used  frequently  in  statistics,  especially  in  linear  models 
and  multivariate  analysis  (see,  for  example,  Graybill,  1961,  Chap.  11;  Searle, 
1971,  Chap.  2;  Anderson,  1965;  and  James,  1954.). 

Orthogonal  matrices  of  determinant  1  represent  elements  of  the  rotation  group 
in  n  dimensions.  Gel'fand  at  al.  (1963)  and  Mumaghan  (1938)  give  extensive 
discussions  of  the  representation  of  the  n -dimensional  rotation  group.  Hoffman  et 
al.  (1972)  and  RaffenetU  and  Ruedenberg  (1970)  represented  the  class  at 
n-dimensional  orthogonal  matrices  in  terms  of  generalized  Eulerian  angles.  The 
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representation  of  three-dimensional  orthogonal  matrices  by  EUiettatt  fthftM  hk£-lon 
been  used  in  the  study  of  the  motion  of  a  rigid  body  (Euler,  1776,  Whittaker, 
1927,  pp.  9-10).  For  further  ideas  and  a  three-dimensional  application,  see 
Moran  (1975). 

The  n*  elements  of  an  orthogonal  matrix  are  subject  to  %n(n+l)  constraints, 
it  is  therefore  not  surprising  that  they  can  be  represented  by  only  n*  -  %n(n+l)  * 
ijn(n-l)  independent  parameters.  A  representation  is  convenient  if  the  whole  of  a 
matrix  can  be  quickly  computed  from  the  fcn(n-l)  parameters.  Such  a  representation 
the  search  for  an  orthogonal  matrix  that  satisfies  some  optimality 
criterion.  The  independent  parameters  can  be  used  also  in  integrating  a  function 
over  the  n-dimensional  orthogonal  group  (Murnaghan,  1938,  pp.  230-242),  which  is 
defined  as  the  set  of  all  n  x  n  orthogonal  matrices  with  the  operation  of  matrix 
*  multiplication. 

As  far  as  we  know,  the  various  methods  of  parameterizing  orthogonal  matrices 
are  not  available  in  the  statistical  literature.  Our  aim  is  to  bring  several  of  these 
methods  to  the  attention  of  the  statistical  profession,  and  we  do  not  claim 
mathematical  originality,  although  probably  a  few  of  our  comments  have  some 
novelty. 

First  we  mention  some  applications.  The  cited  references  may  be  consulted 
for  more  details.  The  rotation  group  of  n  x  n  orthogonal  matrices  will  be  denoted 
by  0(n). 

2.  some  Applications  in  Statistics  of  Parameterization  of  the  Orthosop**  Group. 

(a)  A  parameterization  of  0(n)  can  be  used  to  define  an  invariant  (Baar) 
measure,  ft,  on  0(n).  This  measure  is  invariant  in  the  sense  that  if  a  set  G  of 
orthogonal  n  by  n  matrices  has  measure  ji( G)  then,  for  every  orthogonal  n  by  n 
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matrix  C,  u(G)  =  u(GC)  =  m(£G)  and  Og  [and  £Gj  denoted  Us*  S*i  6f  featrice# 
obtained  by  multiplying  all  the  matrices  in  G  on  the  right  [left]  by  C.  Baa r 
measure  is  unique  up  to  a  positive  multiplicative  constant.  Zn  accordance,  for 
example,  with  James  (1954,  p.  53),  the  Haar  measure  m(G),  if  it  exists,  is  given 
by  the  ifn(n-l)  -fold  integral 

.  l<Kj<n 

ii(G)  ■  [  n  »  6  c  °<“> 

i.j 

where  jpj  and  Cj  are  the  i**1  and  j**1  column  vectors  of  an  orthogonal  matrix  C,  (ej  * 
dcj)  denotes  a  product,  and  the  product  of  differential  forms  is  to  be 

interpreted  as  an  exterior  product.  (Exterior  products  are  explained,  for 
example,  by  James,  1954,  p.  46.)  For  example,  if  n  *  2,  the  differential  form, 
n(c[  •  dcj),  for 


C 


COS  9 
-sin  9 


sin  9 
cos  e 


0  4  0  <  2rr  , 


is  (gi  •  dc*)  »  d9. 

The  Haar  measure  has  been  used  to  derive  the  distrihuHon  of  the  canonical 
correlation  coefficients  (James,  1954).  Anderson  (1965)  used  the  representation 
given  by  formula  (3.2)  of  Section  3  below  to  obtain  the  joint  distribution  of  the 
eigenvalues  of  the  sample  variance -covariance  matrix.  See  also,  for  example, 
wijsman  (1957),  Chattopadhyay  et  al.  (1976). 

(b)  When  carrying  out  simulation  experiments  in  regression  theory,  the 
condition  number  of  Z’Z  is  of  interest,  where  Z  is  the  matrix  associated  with  the 
regression  model.  This  condition  number  is  defined  (Hartree,  1955,  p.  153)  as 
the  ratio  of  the  Largest  to  the  smallest  of  the  eigenvalues  of  Z'Z.  George  Terrell 
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pointed  out  ( private  communication )  that  by  applying  random  orthogonal 
transformations  to  xf  we  can  generate  several  different  X  matrices  for  which  the 
condition  numbers  are  all  equal.  This  would  be  convenient  in  the  simulation 
experiments.  The  generation  of  such  random  orthogonal  transformations  can  be 
facilitated  by  utilizing  the  independent  parameters  obtained  through  parameterization 
of  the  orthogonal  matrices  representing  these  transformations. 

(c)  The  simultaneous  diagonal! ration  of  several  matrices  is  of  considerable 
interest  in  statistics,  especially  in  the  analysis  of  random  -  or  mixed -effects 
models.  Let  At,  Az,  ...  ,  Aj  be  1  (>  2)  positive  definite  symmetric  matrices  of 
order  p-  x  p.  Suppose  that  the  as  matrices  are  not  simultaneously  diagonalizable 
(because  they  do  not  commute),  and  we  wish  to  find  an  orthogonal  matrix,  J»,  which 
makes  them  simultaneously  "as  diagonal  as  passible",  nury  and  GautscM  (19tyS,  p. 
170)  introduced  a  measure  of  simultaneous  deviation  of  the  matrices,  BAXB',  BAZB* , 
...  .  from  dia tonality.  This  measure  is  given  by  the  function 

X(B»  jjt  *  Az »  •  •  •  »  jjl  (  ,  nx ,  ...  ,  D| )  * 

* 

n  (  |diag(BAjB>'  )  J/ I  }”*•  ,  (2.1) 

i»l 

where  diag(^BA^ST)  is  a  diagonal  matrix  having  the  diagonal  elements  as  BA^B* 

(i  »  1,  2,  ...  ,  I),  I • )  denotes  the  determinant  of  a  matrix,  and  nx,  nz,  ...  , 
ng  are  positive  weights,  it  is  known  that  x  >  1  with  equality  occurring  if  and  only 
if  JB  diagonalizes  then's,  it  is  therefore  of  interest  to  find  the  minimum  of  x  over 
the  rotation  group  O(p). 

The  determination  of  the  optimal  orthogonal  matrix,  B0,  that  minimizes  x  to 
(2.1)  can  be  reduced  to  an  optimization  problem  in  ^p(p-i)  dimensions  since  this 


is  ‘the  number  of  independent  parameters  that  represent  the  elements  of  O(P). 
Plury  and  Gautschx  (1986)  did  not  follow  this  procedure;  instead,  they  introduced 
an  iterative  algorithm  whereby  a  converging  sequence  of  orthogonal  matrices, 

jg<  -*■  > ,  ...  ,  was  derived  such  that  x(B^^+1^)  <  j  *  0,  1,  2 .  All 

pairs  of  columns  of  the  orthogonal  matrix  obtained  in  the  iteration  (j  3  0, 

1,  2,  ...)  are  subjected  to  rotations  by  a  specific  2x2  orthogonal  matrix 
parameterized  by  a  single  parameter.  This  yields  the  matrix  and  the 

process  is  repeated  until  some  convergence  criterion  is  met. 

(d)  The  need  for  parameterization  arises  in  response  surface  analysis. 
Consider  a  linear  model  of  the  form 

J  “32.  +  t  *  (2.2) 

where  JT  is  the  vector  of  observations,  0  is  a  vector  of  unknown  parameters,  and  X 
is  a  known  matrix  of  order  n  x  p  and  rank  p.  The  elements  of  X  are  functions  of 
the  settings  of  m  input  variables  denoted  by  xx,  xz,  ...  ,  xm.  The  n  x  m  matrix, 
B  a  {Xyi),  where  Xyj  is  the  setting  of  the  1th  variable  at  the  u**1  experimental  run 
(u  =  1,  2,  ...  ,  n;  j  =  1,  2,  ...  ,  m),  is  called  the  design  matrix.  The 
elements  of  the  error  vector  in  (2.2)  are  assumed  to  be  independently  and 
identically  distributed  random  variables  with  zero  means  and  variances  equal  to  az. 

Tests  of  significance  concerning  the  parameter  vector  &  in  (2.2)  depend  on 
the  assumption  that  js  is  normally  distributed.  The  effect  of  nonnormality  of  the 
error  distribution  on  these  tests  has  been  studied  by  several  authors  (see,  for 
example,  Pearson,  1931;  Geary,  1947;  Gayen,  1990;  and  David  and  Johnson, 
1951a,  1951b).  Box  and  Watson  (1962)  pointed  out  that  the  sensitivity  to 
nomormality  depends  very  much  on  the  settings  of  the  input  variables  specified  in 
the  design  matrix.  This  was  also  demonstrated  by  Vuchkov  and  SolaJtov  (1980). 
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Thus,  a  properly  chosen  design  matfiir  ean  iea4  Ur  Ini*  that  sr#  resistant,  or 
robust,  to  failure  of  the  normality  assumption. 

Box  and  Watson  (1962)  introduced  a  design  robustness  criterion  for  a  model 
of  the  form  (2.2)  of  degree  one  (a  first-order  model),  which  can  be  rewritten  as 

j  -  Wo  +  Of  +  ,  (2.3) 

where  1  is  a  column  vector  of  ones  of  order  n  x  l,  o  is  the  design  matrix  such 

fees  ^ 

that  X  a  pLiiy ,  and  (0O,  r*)‘  =»  js.  Bax  and  Watson's  criterion  is  for  testing  the 
hypothesis 

Ho*  r  a  0  (2.4) 

a  •* 

using  the  mean  square  ratio 

*<£)  *  (I  “  yx'*)'Ax'>r/(n-»-i>i  • 

They  showed  that  in  nonnormal  situations,  F(JD)  is  distributed  approximately  under 
B0  as  an  P -distribution  with  modified  degrees  of  freedom  given  by  a  nm  and 
=  n(n-m-l) .  Here  the  corrective  factor  t)  is  equal  to  unity  if  ((D)  s  0  regardless 
of  the  y't  or  th#ir  distribution,  whsrs 

g(D)  a  d  -  m(m+2)( n-l)/[n( n+1))  ,  (2.S) 

m 

and  dag  df*  with  dj*  being  the  1th  diagonal  element  of  t^D'D)*^' . 

iai  ~  ~  ~  “ 

Since  i)  is  equal  to  unity  when  the  error  distribution  is  normal,  a  design 
matrix  satisfying 

*<D)  ■  0  (2.6) 

will  result  in  an  approximate  p  statistic  with  degrees  of  freedom  identi- 
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cal  to  those  obtained  under  normality;  Consequently ,  the  design  BfttriX 
can  determine  whether  the  distribution  of  P(JD)  is  insensitive  (robust) 
to  nonnormality.  The  matrix  ,  being  idempotent,  can  be  ex¬ 

pressed  in  the  form 

0(D'D)‘xD’  =  P  diag(lm,  o)P*  , 

where JP  is  an  orthogonal  matrix.  In  this  way,  the  parameterization  of 
_D( D’D)~ Mv  can  be  reduced  to  that  of  an  orthogonal  matrix.  The  quantity, 
d,  in  (2.5)  can  be  regarded  as  a  function,  h(P),  of  F  and  (2.5)  becomes 

g(JB)  »  W  *  m( mf  2 )( n-1  )/[n( n+1 ) ]  .  (2.7) 

For  a  fixed  n,  one  method  to  implement  the  Box -Watson  criterion  is  to  find  P  so 
that  gz(JB)  has  an  absolute  minimum  over  the  rotation  group  O(n).  If  this  absolute 
minimum  is  zero,  then  the  Box -Watson  criterion  is  satisfied.  Otherwise,  the  optimal 
design  that  minimizes  g*(.D)  is  the  design  that  comes  "closest"  to  satisfying  this 
criterion.  For  more  details  concerning  this  matter  see  Khuri  and  Myers  (1981). 

The  minimization  process  can  be  carried  out  by  expressing  h(F)  in  (2.7)  in 
terms  of  the  parameters  of  the  orthogonal  matrix  P. 

We  now  describe  various  methods  of  parameterization. 

3.  How  to  Parameterize  an  Orthogonal  Matrix. 

The  four  methods  we  shall  discuss  are  (i)  expressing  an  orthogonal  matrix  in 
the  form  eX  where  T  is  skew -symmetric,  (ii)  using  Cayley's  transformation  which 
also  gives  an  expression  in  terms  of  a  skew -symmetric  matrix,  (iii)  using  Eulerian 
angles  for  the  three-dimensional  case,  and  (iv)  using  generalized  Eulerian  angles 
for  the  n -dimensional  case. 

(i)  If  Q  is  an  n  x  n  orthogonal  matrix  with  determinant  l,  then  it  can  be 
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written  in  the  form 


J9  -  e*  »  (3.1) 

where  T  is  a  skew -symmetric  matrix  (see,  for  example,  Gantmacher,  1939,  p. 

MM 

208).  (The  exponential  is  of  course  definable  as  an  infinite  series.)  The 
elements  of  T,  above  its  main  dtafonal,  can  be  used  to  parameterize  jQ. 

Now,  if  Q^is  given,  then  to  find  T  we  can  first  find  the  eigenvalues  of  g. 
These  axe  necessarily  of  the  form  e***1,  e ***«,  ...  #  e*i*tl,  1  ,  where  the 
eigenvalue  1  is  of  multiplicity  (n  -  2q),  and  none  of  the  real  mashers  Oj 
is  a  multiple  of  2ir  (J  ■  1,  2,  ...  ,  q).  (Those  that  are  odd  multiples 
of  ir  give  an  oven  number  of  eigenvalues  equal  to  -1. )  If  we  denote  the 
matrix  |  ^  j  by  (a  +  bi],  a  notation  that  is  reasonable  because  the 

2x2  matrices  form  a  representation  of  complex  numbers,  then  Q  can  be 
written  (Gantmacher,  1939,  p.  208),  as  the  product  of  three  real  matrices, 

^9  a  Pi  diag{(e  *■],  ...  ,  [e  ,  1,  ...  ,  1)Q^  .  (3.2) 

Bare  Qx  is  an  orthogonal  matrix  of  the  form 

=  2i»  5*'  2*'  *••  » J5q*  .Zq*  JJxq+r*  •••  » 

such  that^Xj  +  iyj  is  an  eigenvector  with  eigenvalue  e**3  O  -  l.  ...  .  q). 
and jifc  is  an  eigenvector  with  eigenvalue  1  (X  =  2q  +  1,  ...  ,  n). 

We  now  define  the  sXew-symetric  matrix  T  by  the  equation 

V  mjJj,  diag {[i*x],  ...  ,  [ibql,  0,  ...  ,  0)Q^  .  (3.3) 

Since  e^**^  =  [s*^]  we  have  Q  =  e^a s  required. 


we  note  that 


q-1 

-  n  * j 

j=0 

where 

JRj  ajj,.  diaggzj,  [e1**^].  in-aj-aJQl  ,  j  *  0,  ,  q-1  , 

and  where  JEzj  and  are  the  identity  matrices  of  dimensions  2j  mid  n-2j-2 , 

respectively.  If  nj+i  is  a  plane  parallel  to  both  the  vectors  and  jfj+i »  then 
represents  a  rotation  of  the  n- space  leaving  fixed  every  point  in  the 
(n-2) -space  orthogonal  to  ( Vitali,  1928).  Thus  q  represents  the  product  of  q 
such  simple  rotations.  When  n  »  3  and  q  ■  1,  this  is  a  f— <n»r  fact  in  dynamics, 
known  to  Euler.  TO  obtain  a  power  Q®  we  could  multiply  all  the  angles  of  rotation 
by  a. 

We  note  also  that 


T(xj  +  iyj)  =»  +  iyj)  .  1  *  i.  z . .  . 

3JK  *  °  .  X  «=  2q+l . n  . 


(3.4) 


Equations  (3.4)  state  that  jjj  +  iyj  is  an  eigenvector  of  jr  with  eigenvalue  i<tj,  and 
that  *|{  is  an  eigenvector  of  T  with  zero  eigenvalue.  Since  the  eigenvalues  of  T 
are  purely  imaginary  or  zero  (at  least  one  of  them  is  zero  if  n  is  odd),  and  the 
imaginary  ones  pair  off  in  conjugate  pairs  ±i$A,  ...  ,  *i«q,  it  follows  that  (3.3) 
gives  a  representation  of  a  real  skew -symmetric  matrix  in  terms  of  its  eigenvalues 
and  eigenvectors  (cantmacher,  1959,  p.  285).  This  observation  enables  us  to 
calculate  g  U  £  is  given  and  if  (3.1)  is  known  to  be  true.  We  first  put  T  in  the 
form  (3.3),  then  we  calculate  Q  by  (3.2).  Tor  convenience  in  computing  the 
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eigenvalues  and  eigenvectors  of  ^T,  we  note  that  xj  and  Jfj  (j  -  lt  ...  ,  q)  are 
eigenvectors  of  the  symmetric  matrix  JT2  with  eigenvalue  .  This  can  be  easily 

seen  from  (3.3),  which  when  both  sides  are  squared  gives 


T2  = 


•  z  +JJZ3) 

3=1 


Example,  n  »  3.  In  three  dimensions  any  orthogonal  matrix  Q,  with 
determinant  X,  has  the  form 


5  *  «P 


0 

-c 

b 


c 

0 

-a 


-b 

a 

o 


The  eigenvalues  of 


T  = 


0 

-c 

b 


c 

o 

-a 


-b 

a 

o 


are  ip,  -ip,  and  o.  Where  p  =  (a2  +  b2  +  c2)^*.  The  normalized  eigenvector 
with  eigenvalue  o  is  w  *  p_i[a,  b,  c] ’  and  the  other  normalized  eigen¬ 
vectors  can  be  chosen  as  u  ±  iv  where 


and 


u  =  <b*  +  c2)_!*[0,  c,  -bj* 


v  *  p*1(b2  +  c2)'^[b2  +  c2,  -ab,  -acj‘ 


The  vectors  and  v,  are  orthonormal  eigenvectors  of  JT*  with  the  eigenvalue 


prom  (3.2),  we  have 


-P*. 

Q  *  [u,  v,  w3dia*{[e'ip].l)Cu.  ▼.  »]*  • 

V  Mf  tv  W  W 

The  matrix  Q  represents  a  rotation  of  the  3 -space  about  a  line  whose  di- 

«W 

rection  cosines  are  (a/p,  b/p,  c/p). 

(ii)  if  n  is  an  orthogonal  matrix  that  does  not  have  the  eigenvalue 
•1,  then  it  may  be  written  in  Cayley’s  form  (Gantmacher,  1999,  p.  289). 

Q  -  (I  -  3)<2  +  9 )-*•  (3.5) 

+*  Atm 

where  S  is  a  skew -symmetric  matrix.  [I  +  a  is  never  singular  if  a  is 

mp  W  ter 

skew-symmetric.  A  proof  of  this  fact  is  given  by  Ferrar  (1990,  p.  193)  who 
attributes  the  fact  to  Cayley.  A  simpler  proof  is  that  all  the  eigenvalues  of  a 
skew -symmetric  matrix  a  are  purely  imaginary  or  zero  (because  it  is  Hermitlan)  and 
therefore  -1  cannot  be  an  eigenvalue  of  _S .  ]  The  form  (3.S)  has  the  advantage  of 
defining  Q  as  a  one-to-one  function  of  S,  but  it  has  the  disadvantage  of  being 
restricted  to  theme  orthogonal  matrices  that  do  not  have -the  eigenvalue  -1.  If  this 
condition  is  not  assumed,  £  can  still  be  written  in  the  fora  (Ferrar,  1990,  p. 
194) 

q  *  j(i  -  a)( x  +  a)**-  (3.6) 

V  a#  ^  vu  Ns 

where  J  is  a  diagonal  matrix  in  which  each  element  on  the  diagonal  is  either  1  or 
-1  (and  where  S  is  skew -symmetric ) .  This  representation  can  be  made  unique  by 
insisting  that  all  the  plus  l’s  precede  all  the  minus  l’s  along  the  diagonal  of  J. 
When  this  representation  is  used  we  must  supplement  the  'jn(n-l)  parameters  in  3 
with  an  assumption  for  the  number,  r,  of  plus  l’s  in  J. 
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(iii)  In  three  dimensions  orthogonal  transformations  with  determinant  1  are  of 
coarse  represented  by  the  rotation  of  a  rigid  body  free  to  turn  about  a  point  0. 
The  motion  of  this  body  is  determined  by  three  independent  angles  known  as  Eulerian 
angles  (Euler,  1776;  Whittaker,  1927,  p.  9;  Condon,  1958,  p.  6).  These  angles 
can  be  introduced  in  various  ways.  The  following  one  is  used  by  Condon z 

Let  OXYZ  be  a  right-handed  system  of  rectangular  axes  fixed  in  space.  Let 
Oxyz  be  rectangular  axes  fixed  relatively  to  the  body  and  moving  with  it,  such  that 
before  the  displacement  the  two  sets  of  axes  OXTZ  and  Oxyz  are  coincident  in 
position.  Let  and  Vj  be  unit  vectors  in  the  poeitive  directions  of  the  Z  and  z 
axes,  respectively,  and  let  JC  *  vg  *  vj  b#  their  vector  product.  Denote  the 
angles  ZOz,  xox,  KOx  by  0,  0,  0,  respectively.  These  are  the  three  Eulerian 
angles. 

By  a  remark  made  in  Section  3(i)  the  total  movement  of  the  body  is  equivalent 
bo  a  rotation.  This  rotation  is  represented  by  the  product  of  three  matrices, 
thus: 


Q  *  diag  {(•*'*'].  l}diag{i,  [ei®]}diag(Cei(>],  1) 


coed  cosd  -  sind  cose  Bind 
-cosd  sind  -  sind  cose  cosd 
Bind  sine 


sind  cosd  +  cosd  cose  sind  sine  sind 
-sind  sind  +  cosd  cose  cosd  sine  cosd 
-cosd  sine  cose 


(3.7) 


This  then  provides  a  parameterization  in  three  dimensions.  The  parameterization  is 
unique  if,  for  example,  0  <  e  <  2n,  0  <  d  <  2 it,  and  0  <  d  «  w. 

(iv)  A  generalization  of  the  concept  of  the  three-dimensional  Eulerian  angles 
to  n  dimensions  was  given  by  Raffenetti  and  Ruedenberg  (1970).  Formulae  and  a 
computer  program  were  derived  that  expressed  an  arbitrary  orthogonal  matrix  of 
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order  n  x  n  in  terns  of  <jn(n-l)  angular  variables.  A  general  n -dimensional 
orthogonal  matrix,  Q ,  can  be  constructed  by  the  sequence  of  recurrence  relations 

s(x)  -  41 

^(i)  =  diagtQ*1*1-),  1}  ,  i  =  2,  3,  ,  n  , 

a(*-)  =  ai-i,iai-*,i  ***  al,i  »  i  ®  2,  3,  ,  n  i 

»  A(i>  q<i>  ,  i  »  2,  3 . n  , 

3  “3?D> 

where  (aitj)  (i  *  X*  2,  ...  ,  n;  j  =  1,  2,  ...  ,  nt  i  <  ])  Is  an  n  x  n  matrix 

whose  diagonal  elements  are  1  except  for  the  diagonal  elements  in  the  1th  and  j**1 

columns,  which  are  cosyij;  all  off-diagonal  elements  are  O  except  for  the  one 

corresponding  to  the  intersection  of  the  1th  row  and  the  j**1  column,  which  is  sinyy , 

and  that  an  the  intersection  of  the  1th  row  and  the  1th  columns  which  is  -sinyj}. 

The  choice  a  1  yields  an  orthogonal  matrix  Q  *  {Qy}  with  determinant  1,  and 

the  choice  Q<x)  =  -1  yields  a  matrix  Q  with  determinant  -1. 

Conversely,  if  an  orthogonal  matrix,  U,  =  {Cy},  is  given,  then  the 

corresponding  angular  variables  are  found  by  minimizing  the  function  f(y) 

**  E  [Oij  '  Qij(y>]*.  and  where  y  »  (yxx,  yL„  ...  ,  yn.ifn)'  (i,  j  *  1, 
i.J 

2,  ...  ,  n),  and  the  elements  of  Q  are  found  by  a  prescribed  recurrence 

%em 

scheme.  Hoffman  et  al  (1972),  however,  give  algebraic  inversion  formulae,  and  a 
computer  program,  for  the  angular  variables  in  terms  of  the  elements  of  the 
orthogonal  matrix. 

4.  A  Problem  Concerning  Baar  Measure. 

Xt  would  be  of  value  for  numerical  integration  over  the  orthogonal  group  if 
Haar  measure  could  be  generated  by  fcn(n-l)  statistically  independent  random  angles 
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each  uniformly  distributed.  For  we  could  then  approximate  the  integration  over  the 
orthogonal  group  by  a  discrete  sum  over  points  uniformly  spaced  on  circles.  In 
three  dimensions  the  Eulerian  angles  do  not  serve  this  purpose.  For  suppose  that 
e,  and  4)  are  each  uniformly  distributed.  Then  cose  cannot  have  the  same 
distribution  as  the  other  elements  on  the  diagonal  of  the  matrix  in  (3.7),  so  that 
the  transformation  is  not  symmetrical  with  respect  to  the  axes.  Thus  the  distribution 
cannot  be  invariant  under  all  rotations. 

One  can  arrive  at  Saar  measure  for  the  three-dimensional  rotation  group  by 
choosing  an  of  rotation  with  uniform  polar  angles  in  (0,  tr)  and  (o,  2ir)  and 
then  an  angle  u  of  rotation  with  uniform  distribution  in  (0,  2s).  The 

— ****  giving  this  orthogonal  transformation,  expressed  in  berms  of  u  and  the 
direction  cosines,  coax,  cosji,  and  cost/,  of  the  axis  of  rotation,  is 

1  -  2sfh*  2CjCj.lt*  -  c,s  2cxc,h*  +  czs 

2cxczh*  +  c,s  1  -  2s|h*  2czc,hz  -  cxs  ,  (4.1) 

2cxc,h*  -  cxs  2c2c3h*  +  CjS  1  -  2s^h* 

where  s  a  sinu,  h  »  sin%u,  sx  =  sinA,  a2  ■  sinji,  s3  *  sinw,  cx  ■  cosx,  c2  * 
cornu,  c,  »  coav .  This  result  can  be  readily  derived  from  Whittaker  (1927,  p.  a). 
Formula  (4.1)  provides  a  matrix  representation  of  the  rotation  group  and  it  is 
equipped  with  Hasr  measure  if  x,  ji,  v  and  u  have  uniform  distributions  in  (0,  2s), 
(O,  2s),  (o,  2s),  and  (0,  s)  respectively.  It  seems  to  be  difficult  to  generalize 
this  result  to  n  dimensions. 
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